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ABSTRACT 



Eesides the ubiquitous Pearson product-moment r, 
there are a number ct ether measures cf relationship that are 
attenuated by errors of measurement and tor which the relationship 
between true neasures can be estimated. Among these are the 
correlation ratio (eta squared), Kelley's unbiased correlation ratio 
(epsilon squared). Hays' cmega squared, and the intraclass 
coefficient of correlation, expressed as a ratio of variance 
components by Ronald Fisher. This paper shows how tc correct each of 
these for attenuation. Such corrections permit estimates of 
relationships between true scores to be made when estimates of 
relevant reliability coefficients are available. (Author) 
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As Miss Finucci indicated, for two-thirds of a century measurement spe- 
cialists have been concerned with correcting the Pearson product-moment coef- 
ficient of correlation for diminution due to errors of measurement. How 
errors of measurement affect other simple measures of co-relationship does 
not seem to have been studied as thoroughly. At the risk of repeating result 0 
that may already be in the literature somewhere, though unknown to me and 
probably to you, Mr. Livingston and I will provide the rationale and results 
for several correlational statistics. These are eta, epsilon, omega, and 
Intraclass r . 

Let us begin with the venerable correlation ratio, eta squared. In 
analysis of variance language for a one-way classification it may be expressed 
as the ratio of the stun of squares between groups to the total sum of squares. 

Your handout shows the abstract layout. Eta has perhaps been most use- 
ful recently for studying the clustering of groups in survey sampling, where 
group me-.vL.orn)tlp constitutes a nominal classification. There may be any num- 
ber of groups from 2 upward. How great are differences among groups on a 
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dependent variable • compared with differences within groups? Errors of clas- 
sification may occur, as for example when an attorney is misclassified as a 
physician, but these are not errors of measurement in the classical sense. 

In this paper we deal only with the attenuating Influence of errors of meas- 
urement in the dependent variable. We do not consider correlation ratios for 
those situations where one may compute two different etas, one for Y on X 
and the other for X on Y. 

The method and results are shown in the handout. We start with eta 

squared and give it rather detailed treatment. Eta squared is defined in 

formula (1). There the SS’s are sums of squares from an analysis of variance. 

The correction for attenuating errors of measurement in Y is given in 

formula (6). The first term there is eta squared divided by the reliability 

2 

coefficient of the Y's , which is the same correction one makes in r . 

However, there is a non-negative subtraction term that becomes 0 only for 

infinite N or for perfect reliability of the Y’s . This reduction term 

seems required because the chance value of eta squared is not zero, as it is 
2 

for r , but Instead is capital 6 minus one, divided by capital N minus 
one. Thus, for finite sample size the range of possible non-chance etas is 
less than the range of r r s . 

Formula (7) shows Kelley's so-called unbiased correlation ratio, epsilon 
squared, as a function of eta squared. Its attenuation due to errors of meas- 
urement in the Y's is caused by attenuation in eta squared. 

Hays' ratio, which he called omega squared, is shown in formula (8). It 
differs from epsilon squared only by having the mean-sauared-within in its 
denominator. This of course causes it to yield lower values than epsilon 
squared. The attenuating effects of errors of measurement tend to affect its 
numerator little, but they can greatly Increase its denominator. 



Eta, epsilon, and omega are closely related, as Glass and Hakstian 
showed splendidly in their 1969 AERJ article that won for them the Palmer 0* 
Johnson Memorial Award they received last night. The fourth correlation- 
ratio which we consider * intraclass r , is similar in appearance. Compare 
formulas (8) and (10). Omega squared and intraclass r have the same numer- 
ator and the same mean squares in their denominators , but the coefficients 
of the mean-squares-within is considerably different in the two denominators. 
Intraclass r has a rather different rationale from eta, epsilon, and omega. 
As explained in the handout, it is based on a ratio of variance components 
in the situation where levels of the ANOVA factor have been sampled randomly 
from a target population of such levels. 

Intraclass r is especially useful when, for example, one wishes to 
study the correlation of the heights of brothers. “Family 1 ' then constitutes 
the ANOVA factor. A sample of capital G families is drawn randomly from a 
large population of families to which one wishes to generalize. Then the 
heights of the brothers in the sample are obtained, and an analysis of vari- 
ance is performed. The variance of the means of all families in the popula- 
tion is estimated. This estimated variance component is then divided by 
itself plus the estimated within- group variance, as shown in the first part 
of formula (9) , 

The numerator of formula (10) is affected little by errors of measure- 
ment, whereas the denominator can be increased considerably by them. However, 
intraclass r would seem to be less attenuated by errors of measurement in 
the Y's than epsilon squared or omega squared are. One must remember, 
though, that these three statistics do not have the same value for a given 
set of data* Intraclass r will be largest, and omega squared smallest. 
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In this brief paper and handout we have merely scratched the surface of 
an important and seemingly neglected area. Perhaps it has had little atten- 
tion from educational researchers and psychologists because analytical survey 
sampling has been the province mainly of sociologists. Survey researchers 
have not;, until recently, seemed much interested in errors of measurement. 
There are signs that this may be changing. 

Persons who delve into this area will quickly discern problems that we 
have only implied in our presentation. Among these are unbiasedness versus 
maximum likelihood estimation; computation of appropriate reliability coeffi- 
cients, correction for errors of measurement in the X variable, and the 
influence of technical errors in setting factor levels. For some surveys 
misclassification may be an important cause of lowered correlation ratios. 

For other surveys or experiments, imprecise determination of the factor 
levels, particularly of an ordered variable, may have serious consequences. 

Most of the techniques and procedures needed to further the study of 
correlation ratios already exist. There are unsolved problems enough for at 
least one worthwhile doctoral dissertation. 
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Handout for paper by Julian C. Stanley and Samuel A. 
Livingston entitled "Correcting Four Similar Correla- 
tional Measures for Attenuation Due to Errors of Meas- 
urement in the Dependent Variable,*' presented on 5 Feb- 
ruary 1971 as one of the five papers of a symposium 
entitled "Some Attenuating Effects of Errors of Measure- 
ment," American Educational Research Association con- 
vention.. New York City. 
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Define the correlation ratio as 
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Take its expectation over an Infinite number of random samples of N obser- 
vations each: 
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Analytically, the approximation seems fairly good if N Is moderately large. 
For error-free Y measures one has 
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where T designates true scores on the dependent (l.e., Y) variable. 
Formula (3) may be rewritten as 



(3) 
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because of the familiar model of classical test-score theory that produces the 
2 2 2 2 

relationship °y 0 °T + °e * °e t * ia var *- ance measurement errors. 

A computing form of (4) Is 
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(5) 



where a^ = MS tot (l - p^ Y ) ** MS w (l - p* y ) . P Y Y is the overall reliability 
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coefficient 

of the Y ? s 

See Stanley 
Formula 
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of the Y’s » P T y is the within-groups reliability coefficient 



2 2 

. Via classical test-score theory it is assumed that O = a 

0 6 

t W 

(1971). 

(5) may also be written as 
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This makes clear that, when reliability of the Y's is perfect (i.e., 

"ijl ’ » • n T y X ■ ^X • 

Kelley's (1935) Epsilon Squared 



a 2 *2 a 2 g - i g — l 

V.X > n YX when Tiy^ > — — , because — - j - is the chance value of 
X 
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Tlyjj • This is the '‘bias" that Kelley tried to remove via his epsilon squared 



(see Glass and Hakstian, 1969); 
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(7) 



When MS w » MS^ , epsilon squared becomes 0 . 

2 2 
Correcting £ for attenuation merely Involves correcting the n in 

its formula via formula (5) or (6), above, or directly by subtracting the 

variance error of measurement from numerator and denominator of (7). 
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Hay's (1963) Omega Squared 



Hays ! co 2 
Hakstian, 1969, 
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differs from £ in t.tie denominator only (see Glass and 
formulas 13 and 14), so it Is also "unbiased.’" 
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If one merely subtracts O from MS. . and also from MS in the numera- 

e tot w 

tor, it is unaffected. (Of course, MS. . - MS., is somewhat affected by 

tot w 

errors of measurement, though not systematically.) The denominator, however, 

~2 2 
will be reduced by NO g , and hence the corrected value may greatly exceed w 



The Intraclass Coefficient of Correlation 

Intraclass p (Harris, 1913; Fisher, 1925) is defined in random-effects- 
model ANOVA components of variance as follows: 
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^ ~ n 2 = • • • = tig = n * N/G , then formulajj(S) reduces to 
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dance for a random-effects 
d-effects situation only. 



Thus, though its numerator is the same as that? for eta squared and 

! 

epsilon squared, its denominator is smaller. This ijoccurs because of the way 
that Pi ntrac i ass is defined via components of vai 

ANOVA. The other three statistics are for the fix* 

i 

Because errors of measurement apply to repeated testing of the same examinees 
however, rather than to sampling fluctuations, thef correction for attenuation 
procedure is similar. A random sample of examinee’s would receive a random 
sample of factor levels. Repeated testing (perhais conceptual rather than 
actual) would be with the same examinees but with/ different factor levels. 
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